AITopics | probability vector

Collaborating Authors

probability vector

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Details

Neural Information Processing SystemsApr-24-2026, 12:47:31 GMT

To keep experiments uniform, for all datasets (STL-10, CIFAR-10, and CIFAR-100) we used a train/val/test partitioning. In our experiments we compared FED with four baselines. For all baselines we tried different learning rates [0.1, 0.01, 0.001] and batch sizes [32, 64, 100]. For EnDD and EnDD + AUX, we used the same temperature, temperature annealing, and optimizer that was used in the original paper. For AMT, we tried different alphas [1e1, 1e3, 1e5] and kept the rest as the original paper.

agreementece, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Estimating Staged Event Tree Models via Hierarchical Clustering on the Simplex

Shoaib, Muhammad, Riccomagno, Eva, Leonelli, Manuele, Varando, Gherardo

arXiv.org Machine LearningMar-17-2026

Staged tree models enhance Bayesian networks by incorporating context-specific dependencies through a stage-based structure. In this study, we present a new framework for estimating staged trees using hierarchical clustering on the probability simplex, utilizing simplex basesd divergences. We conduct a thorough evaluation of several distance and divergence metrics including Total Variation, Hellinger, Fisher, and Kaniadakis; alongside various linkage methods such as Ward.D2, average, complete, and McQuitty. We conducted the simulation experiments that reveals Total Variation, especially when combined with Ward.D2 linkage, consistently produces staged trees with better model fit, structure recovery, and computational efficiency. We assess performance by utilizing relative Bayesian Information Criterion (BIC), and Hamming distance. Our findings indicate that although Backward Hill Climbing (BHC) delivers competitive outcomes, it incurs a significantly higher computational cost. On the other, Total Variation divergence with Ward.D2 linkage, achieves similar performance while providing significantly better computational efficiency, making it a more viable option for large-scale or time sensitive tasks.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

2603.15568

Country:

North America > United States > Wisconsin (0.04)
North America > United States > New York (0.04)
North America > United States > Kansas (0.04)
(5 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Credal Deep Ensembles for Uncertainty Quantification

Neural Information Processing SystemsFeb-16-2026, 15:52:44 GMT

This paper presents an innovative approach to classification tasks called Credal Deep Ensembles (CreDEs), ensembles of novel Credal-Set Neural Networks (CreNets), aiming to improve EU quantification in the framework of credal inference.

artificial intelligence, credal, machine learning, (15 more...)

Neural Information Processing Systems

Country: Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.92)

Add feedback

Confidence Calibration of Classifiers with Many Classes

Neural Information Processing SystemsFeb-16-2026, 13:39:21 GMT

When such components are expected to be embedded in safety-critical systems (e.g.,

calibration, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > France (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre:

Research Report > Experimental Study (0.92)
Overview (0.67)

Industry: Energy (0.42)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

DATA: Differentiable ArchiTecture Approximation

Jianlong Chang, xinbang zhang, Yiwen Guo, GAOFENG MENG, SHIMING XIANG, Chunhong Pan

Neural Information Processing SystemsFeb-12-2026, 14:51:42 GMT

Neural Information Processing Systems http://nips.cc/

architecture, architecture search, estimator, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Best-of-All-WorldsBoundsfor OnlineLearningwithFeedbackGraphs

Neural Information Processing SystemsFeb-11-2026, 20:03:23 GMT

Here, θ() is the clique coveringnumberofthegraph. Online learning models a repeated interaction between a learner and an environment.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Industry: Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

0b7f639ef28a9035a71f7e0c04c1d681-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 10:15:03 GMT

ForDM, due to high memory requirements, we were able to go up to aBatchEnsemble with an ensemble size of 8, while being able to use only batch size of 32. In addition, for this baseline we used a bigger memory GPU, unable tofitthetraining toourstandard 11GBGPU usedfortherestofour experiments. In the procedure of creating a Mixup [8] auxiliary dataset, we used a Beta distribution withα = 0.2. In Mixup augmentation, and valueλ [0,1] is sampled from a Beta distribution. We use batch size of 64.

agreementece, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Hardware (0.56)
Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

DATA: Differentiable ArchiTecture Approximation

Neural Information Processing SystemsDec-25-2025, 14:06:57 GMT

Neural architecture search (NAS) is inherently subject to the gap of architectures during searching and validating. To bridge this gap, we develop Differentiable ArchiTecture Approximation (DATA) with an Ensemble Gumbel-Softmax (EGS) estimator to automatically approximate architectures during searching and validating in a differentiable manner. Technically, the EGS estimator consists of a group of Gumbel-Softmax estimators, which is capable of converting probability vectors to binary codes and passing gradients from binary codes to probability vectors. Benefiting from such modeling, in searching, architecture parameters and network weights in the NAS model can be jointly optimized with the standard back-propagation, yielding an end-to-end learning mechanism for searching deep models in a large enough search space. Conclusively, during validating, a high-performance architecture that approaches to the learned one during searching is readily built. Extensive experiments on a variety of popular datasets strongly evidence that our method is capable of discovering high-performance architectures for image classification, language modeling and semantic segmentation, while guaranteeing the requisite efficiency during searching.

differentiable architecture approximation, high-performance architecture, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.98)

Add feedback

MEDIC: a network for monitoring data quality in collider experiments

Bassa, Juvenal, Chattopadhyay, Arghya, Malik, Sudhir, Rivera, Mario Escabi

arXiv.org Artificial IntelligenceNov-25-2025

Data Quality Monitoring (DQM) is a crucial component of particle physics experiments and ensures that the recorded data is of the highest quality, and suitable for subsequent physics analysis. Due to the extreme environmental conditions, unprecedented data volumes, and the sheer scale and complexity of the detectors, DQM orchestration has become a very challenging task. Therefore, the use of Machine Learning (ML) to automate anomaly detection, improve efficiency, and reduce human error in the process of collecting high-quality data is unavoidable. Since DQM relies on real experimental data, it is inherently tied to the specific detector substructure and technology in operation. In this work, a simulation-driven approach to DQM is proposed, enabling the study and development of data-quality methodologies in a controlled environment. Using a modified version of Delphes -- a fast, multi-purpose detector simulation -- the preliminary realization of a framework is demonstrated which leverages ML to identify detector anomalies as well as localize the malfunctioning components responsible. We introduce MEDIC (Monitoring for Event Data Integrity and Consistency), a neural network designed to learn detector behavior and perform DQM tasks to look for potential faults. Although the present implementation adopts a simplified setup for computational ease, where large detector regions are deliberately deactivated to mimic faults, this work represents an initial step toward a comprehensive ML-based DQM framework. The encouraging results underline the potential of simulation-driven studies as a foundation for developing more advanced, data-driven DQM systems for future particle detectors.

data mining, machine learning, simulation, (20 more...)

arXiv.org Artificial Intelligence

2511.18172

Country: North America (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

probability vector

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Details

Estimating Staged Event Tree Models via Hierarchical Clustering on the Simplex

Credal Deep Ensembles for Uncertainty Quantification

Confidence Calibration of Classifiers with Many Classes

DATA: Differentiable ArchiTecture Approximation

Best-of-All-WorldsBoundsfor OnlineLearningwithFeedbackGraphs

1bd2caf96a17d892c2c7e9959549cfc7-Paper.pdf

0b7f639ef28a9035a71f7e0c04c1d681-Supplemental-Conference.pdf

DATA: Differentiable ArchiTecture Approximation

MEDIC: a network for monitoring data quality in collider experiments